Feature space generalized variable parameter HMMs for noise robust recognition
نویسندگان
چکیده
Handling variable ambient noise is a challenging task for automatic speech recognition (ASR) systems. To address this issue, multi-style training using speech data collected in diverse noise environments, noise adaptive training or uncertainty decoding techniques can be used. An alternative approach is to explicitly approximate the continuous trajectory of Gaussian component or model space linear transform parameters against the varying noise, for example, using generalized variable parameter HMMs (GVP-HMM). In order to reduce the computational cost of conventional GVP-HMMs when model parameter update against the varying noise condition is required, this paper investigates a novel and more efficient extension of GVPHMMs that can also model the trajectories of feature space linear transforms. Significant error rate reductions of 9.3% and 18.5% relative were obtained over the multi-style training baseline system on Aurora 2 and a medium vocabulary Mandarin Chinese speech recognition task respectively.
منابع مشابه
Efficient use of DNN bottleneck features in generalized variable parameter HMMs for noise robust speech recognition
Recently a new approach to incorporate deep neural networks (DNN) bottleneck features into HMM based acoustic models using generalized variable parameter HMMs (GVPHMMs) was proposed. As Gaussian component level polynomial interpolation is performed for each high dimensional DNN bottleneck feature vector at a frame level, conventional GVPHMMs are computationally expensive to use in recognition t...
متن کاملGeneralized Variable Parameter HMMs for Noise Robust Speech Recognition
Handling variable ambient noise is a challenging task for automatic speech recognition (ASR) systems. To address this issue, multi-style, noise condition independent (CI) model training using speech data collected in diverse noise environments, or uncertainty decoding techniques can be used. An alternative approach is to explicitly approximate the continuous trajectory of Gaussian component mea...
متن کاملNoise-robust ASR by Using Disti Approximated with Logarithmic No
Various approaches focused on noise-robustness have been investigated with the aim of using an automatic speech recognition (ASR) system in practical environments. We have previously proposed a distinctive phonetic feature (DPF) parameter set for a noise-robust ASR system, which reduced the effect of high-level additive noise[1]. This paper describes an attempt to replace normal distributions (...
متن کاملDeep neural network bottleneck features for generalized variable parameter HMMs
Recently deep neural networks (DNNs) have become increasingly popular for acoustic modelling in automatic speech recognition (ASR) systems. As the bottleneck features they produce are inherently discriminative and contain rich hidden factors that influence the surface acoustic realization, the standard approach is to augment the conventional acoustic features with the bottleneck features in a t...
متن کاملروشی جدید در بازشناسی مقاوم گفتار مبتنی بر دادگان مفقود با استفاده از شبکه عصبی دوسویه
Performance of speech recognition systems is greatly reduced when speech corrupted by noise. One common method for robust speech recognition systems is missing feature methods. In this way, the components in time - frequency representation of signal (Spectrogram) that present low signal to noise ratio (SNR), are tagged as missing and deleted then replaced by remained components and statistical ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013